rank | frequency | n-gram |
---|---|---|
1 | 466627 | -a |
2 | 383104 | -e |
3 | 266973 | -y |
4 | 264737 | -i |
5 | 225778 | -m |
rank | frequency | n-gram |
---|---|---|
1 | 140995 | -ie |
2 | 116635 | -ch |
3 | 87560 | -em |
4 | 77994 | -ki |
5 | 75340 | -go |
rank | frequency | n-gram |
---|---|---|
1 | 72032 | -ego |
2 | 68235 | -nie |
3 | 64002 | -ych |
4 | 37994 | -nia |
5 | 32900 | -ach |
rank | frequency | n-gram |
---|---|---|
1 | 28670 | -nych |
2 | 26261 | -anie |
3 | 21750 | -iego |
4 | 20751 | -nego |
5 | 18010 | -wych |
rank | frequency | n-gram |
---|---|---|
1 | 18801 | -kiego |
2 | 17113 | -owych |
3 | 13201 | -owego |
4 | 11853 | -skiej |
5 | 11851 | -wanie |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings